Eliminating Duplicates under Interval and Fuzzy Uncertainty: An Asymptotically Optimal Algorithm and Its Geospatial Applications
نویسندگان
چکیده
Geospatial databases generally consist of measurements related to points (or pixels in the case of raster data), lines, and polygons. In recent years, the size and complexity of these databases have increased signiicantly and they often contain duplicate records, i.e., two or more close records representing the same measurement result. In this paper, we address the problem of detecting duplicates in a database consisting of point measurements. As a test case, we use a database of measurements of anomalies in the Earth's gravity eld that we have compiled. In this paper, we show that a natural duplicate deletion algorithm requires (in the worst case) quadratic time, and we propose a new asymptotically optimal O(n log(n)) algorithm. These algorithms have been successfully applied to gravity databases. We believe that they will prove to be useful when dealing with many other types of point data. Geospatial databases: general description. In many application areas, researchers and practitioners have collected a large amount of geospatial data. 1 For example, geophysicists measure values d of the gravity and magnetic elds, elevation, and reeectivity of electromagnetic energy for a broad range of wavelengths (visible, infrared, and radar) at diierent geographical points (x; y); see, e.g., 35]. Each type of data is usually stored in a large geospatial database that contains corresponding records (x i ; y i ; d i). Based on these measurements, geophysicists generate maps and images and derive geophysical models that t these measurements. Gravity measurements: case study. In particular, gravity measurements are one of the most important sources of information about subsurface structure and physical conditions. There are two reasons for this importance. First, in contrast to more widely used geophysical data like remote sensing images, that mainly reeect the conditions of the Earth's surface, gravitation comes from the whole Earth (e.g., 19, 20]). Thus gravity data contain valuable information about much deeper geophysical structures. Second, in contrast to many types of geophysical data, which usually cover a reasonably local area, gravity measurements cover broad areas and thus provide important regional information. The accumulated gravity measurement data are stored at several research centers around the world. One of these data storage centers is located at the University of Texas at El Paso (UTEP). This center contains gravity measurements collected throughout the United States and Mexico and parts of Africa. The geophysical use of gravity database compiled at UTEP is illustrated for Duplicates: …
منابع مشابه
The Use of Fuzzy Measures as a Data Fusion Tool in Geographic Information Systems: Case Study
Geospatial databases generally consist of measurements related to points (or pixels in the case of raster data), lines, and polygons. In recent years, the size and complexity of these databases have increased significantly and they often contain duplicate records, i.e., two or more close records representing the same measurement result. In this paper, we use fuzzy measures to address the proble...
متن کاملUNCERTAINTY DATA CREATING INTERVAL-VALUED FUZZY RELATION IN DECISION MAKING MODEL WITH GENERAL PREFERENCE STRUCTURE
The paper introduces a new approach to preference structure, where from a weak preference relation derive the following relations:strict preference, indifference and incomparability, which by aggregations and negations are created and examined. We decomposing a preference relation into a strict preference, anindifference, and an incomparability relation.This approach allows one to quantify diff...
متن کاملAn Assessment Method for Project Cash Flow under Interval-Valued Fuzzy Environment
Effective project management requires reliable knowledge of cash required in different stages of project life cycle. Getting this knowledge is highly dependent on sophisticated consideration of project environment. Nature of projects and their environments are associated with uncertain conditions. In this paper, a new project cash flow assessment method based on project scheduling is proposed t...
متن کاملA Multi-Criteria Analysis Model under an Interval Type-2 Fuzzy Environment with an Application to Production Project Decision Problems
Using Multi-Criteria Decision-Making (MCDM) to solve complicated decisions often includes uncertainty, which could be tackled by utilizing the fuzzy sets theory. Type-2 fuzzy sets consider more uncertainty than type-1 fuzzy sets. These fuzzy sets provide more degrees of freedom to illustrate the uncertainty and fuzziness in real-world production projects. In this paper, a new multi-criteria ana...
متن کاملSHAPLEY FUNCTION BASED INTERVAL-VALUED INTUITIONISTIC FUZZY VIKOR TECHNIQUE FOR CORRELATIVE MULTI-CRITERIA DECISION MAKING PROBLEMS
Interval-valued intuitionistic fuzzy set (IVIFS) has developed to cope with the uncertainty of imprecise human thinking. In the present communication, new entropy and similarity measures for IVIFSs based on exponential function are presented and compared with the existing measures. Numerical results reveal that the proposed information measures attain the higher association with the existing me...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Reliable Computing
دوره 10 شماره
صفحات -
تاریخ انتشار 2004